DNA Sequence Classification in the Presence of Sequencing Errors Project of CSE847
نویسنده
چکیده
We propose a new method to solve the problem of DNA sequence classification in the presence of sequence errors. This method incorporates sequencing error models into standard Viterbi Algorithm(AJ, 1967) which is used to find the best path that a query DNA sequence is generated by a profile Hidden Markov Model (profile HMM)(SR., 1998). Our method will correct the sequencing errors and then align the corrected query sequence to different protein families. By sequencing error correction our method will improve the prediction accuracy for sequence classification. Moreover, we will classify more sequences to one of the candidate families which cannot be classified by current tools due to sequencing errors.
منابع مشابه
A preliminary study on phylogenetic relationship between five sturgeon species in the Iranian Coastline of the Caspian Sea
The phylogenetic relationship of five sturgeon species in the South Caspian Sea was investigated using mtDNA molecule. Sequence analysis of mtDNA D-loop region of five sturgeon species [Great sturgeon (Huso huso), Russian sturgeon (Acipenser gueldenstaedtii), Persian sturgeon (Acipenser persicus), Ship sturgeon (Acipenser nudiventris), Stellate sturgeon (Acipenser stellatus)] and DNA sequencing...
متن کاملA preliminary study on phylogenetic relationship between five sturgeon species in the Iranian Coastline of the Caspian Sea
The phylogenetic relationship of five sturgeon species in the South Caspian Sea was investigated using mtDNA molecule. Sequence analysis of mtDNA D-loop region of five sturgeon species [Great sturgeon (Huso huso), Russian sturgeon (Acipenser gueldenstaedtii), Persian sturgeon (Acipenser persicus), Ship sturgeon (Acipenser nudiventris), Stellate sturgeon (Acipenser stellatus)] and DNA sequencing...
متن کاملCloning and sequencing of Toxoplasma gondii major surface antigen (SAG1) gene
Genetic typing methods of T. gondii strains have been extensively perfected in recent years. From a technical point of view, many tools usable for genetic studied on single-copy loci have been used: RFLP, PCR-RFLP, sequencing, RAPD-PCR and isoenzyme analysis. We described the cloning and sequence analysis of the gene which encodes the major surface antigen (SAG1 or P30) of T. gondii. SAG1 is ...
متن کاملIsolation and identification of Eurotium species from contaminated rice by morphology and DNA sequencing
30 milled rice samples were collected from retailers in four states of Malaysia. These samples were evaluated for Eurotium spp. contaminations by direct plating on malt extract salt agar (MESA). All Eurotium were isolated and identified based on morphology and nucleotide sequences of internal transcribed spacer 1 (ITS1) and ITS2 of the rDNA. Four Eurotium species (E. rubrum, E. amstelodami, E....
متن کاملStudy of genetic diversity of wild Caspian trout Salmo trutta caspius in the Sardabrud and Astara Rivers, using D- Loop region sequencing
In this study the genetic diversity of wild Caspian trout (Salmo trutta caspius) in the Sardabroud and Astara Rivers was evaluated using D- Loop region sequencing. For this purpose, 35 specimens of adult Caspian brown trout were collected from these rivers in the Mazandarn and Gilan Provinces in fall and winter 2011. Approximately 3-5 g of soft and fresh fin tissue was isolated and fixed in eth...
متن کامل